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ABSTRACT 

The early lytic phase of Kaposi's sarcoma herpes- 
virus infection is characterized by viral replication 
and the global degradation (shutoff) of host mRNA. 
Key to both activities is the virally encoded alkaline 
exonuclease KSHV SOX. While the DNase activity of 
KSHV SOX is required for the resolution of viral 
genomic DNA as a precursor to encapsidation, its 
exact involvement in host shutoff remains to be 
determined. We present the first crystal structure 
of a KSHV SOX-DNA complex that has illuminated 
the catalytic mechanism underpinning both its endo 
and exonuclease activities. We further illustrate that 
KSHV SOX, similar to its Epstein-Barr virus homo- 
logue, has an intrinsic RNase activity in vitro that 
although an element of host shutoff, cannot solely 
account for the phenomenon. 

INTRODUCTION 

Kaposi's sarcoma associated herpesvirus (KSHV, also 
known as Human Herpes Virus-8), is a member of the 
y-herpesvirus family of double-stranded DNA viruses. 
KSHV infects both B and endothelial cells where it is 
the main causative agent of Kaposi's sarcoma (KS) with 
epidemic or AIDS-related KS being the most common 
cancer associated with HIV infection (1). In addition to 
KS, KSHV has also been linked to a number of lymp- 
hoproliferative disorders that include primary effusion 
lymphoma and Multicentric Castleman's disease (2-4). 
While KS can be successfully treated in the early stages 
of HIV infection by anti-retroviral drugs, there are no 
effective treatments for the advanced disease or 
KSHV-associated disorders. 



The KSHV life cycle, similar to all herpesviruses, can be 
divided into a latent and lytic phase. In the majority of 
infected cells, the virus remains in a latent state that is 
characterized by a highly restricted pattern of gene expres- 
sion. This contrasts with the lytic phase that is dominated 
by the over expression of genes required for host cell lysis 
and viral replication. During the initial stages of the 
lytic phase, a highly conserved alkaline exonuclease is 
expressed that is encoded by ORF 37 in KSHV. The 
precise function of the viral alkaline exonucleases in the 
context of viral genomic DNA replication is unclear; 
however, evidence [mostly obtained from Herpes simplex 
virus- 1 (HSV-1)] suggests that they are involved in the 
processing of viral genomic DNA prior to encapsidation 
and facilitate the egress of viral particles from the nucleus 
to the cytoplasm (5-7). Consistent with this, three-way 
viral genomic DNA junctions have been observed during 
HSV infection (8,9). In the absence of alkaline exonuclease 
activity, these junctions accumulate in the nucleus result- 
ing in a marked reduction in the number of mature viral 
particles released from infected cells (5). In support of this 
'resolvase' role, the viral alkaline exonucleases exhibit 
Mg 2+ or Mn 2+ dependent 5 -3' exonuclease activities 
in vitro (in addition to a less prevalent endonuclease 
activity) processing both double- and single-stranded 
DNA substrates (10-13). Owing to the presence of a 
PD-(D/E)XK motif, the viral alkaline exonucleases have 
been classified as members of the type II restriction 
endonuclease-like superfamily (14), which has been con- 
firmed by the recent crystal structures of the native KSHV 
enzyme and its Epstein-Barr virus (EBV) homologue 
BGLF5 (15,16). Both reveal conservation of motifs key 
to DNase activity (7) that were first observed in the re- 
striction endonucleases EcoRI, BamHI and Hindi (17- 
19), together with a 'bridge' spanning the N- and 
C-terminal lobes that is also present in bacteriophage A 



*To whom correspondence should be addressed. Tel: 0207 631 6822; Fax: 0207 631 6803; Email: t.barrett@mail.cryst.bbk.ac.uk 
© The Author(s) 2011. Published by Oxford University Press. 

This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/ 
by-nc/2.5), which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited. 



Nucleic Acids Research, 2011, Vol. 39, No. 13 5745 



exonuclease (20). Bridge or arch structures are observed in 
a large variety of nucleases that in addition to type II 
restriction enzymes include members of the flap endo- 
nuclease family where they are thought to function in sub- 
strate recognition (21,22). 

Onset of the lytic phase in the herpesviruses not only 
coincides with viral replication, but also the rapid and 
global degradation of host mRNA. This is thought to be 
one of several mechanisms utilized by these viruses to 
evade immune detection (23). A key protagonist of host 
shutoff in the y-herpesvirus subgroup is the alkaline exo- 
nuclease that has been renamed SOX (shutoff and exo- 
nuclease) in recognition of its dual role. While host 
shutoff has also been observed in HSV infection and ex- 
tensively studied in HSV-1, it appears to be mechanistic- 
ally dissimilar to that of the y-herpesviruses owing to the 
requirement for the UL 41 -encoded viral host shutoff 
protein (vhs) that actively degrades mRNA and is unre- 
lated to the SOX homologues (24). Although the exact 
mechanism in the y-herpesviruses remains to be fully 
elucidated, the analysis of various SOX mutants has 
shown that the DNA processing and shutoff activities 
are genetically separable and occur in different cellular 
compartments (25). Until recently, it was thought that 
the global degradation of host mRNA could not be 
explained by an intrinsic RNase activity of the SOX 
homologues, since none had been detected in vitro. 
Alternatively, it was proposed to originate from subver- 
sion of a host cellular pathway involving the erroneous 
activation or targeting of proteins participating in 
mRNA degradation pathways (26,27). A Mn 2+ -dependent 
RNase activity has, however, been recently characterized 
for BGLF5, where cleavage of both RNA and DNA are 
dependent on the same catalytic machinery (16). Since the 
available crystal structures show that the majority of 
non-catalytic residues implicated in host shutoff map to 
the N- terminal region of the molecule, and are therefore 
distant from the catalytic centre, it has been speculated 
that recognition of RNA and DNA substrates might 
require different structural motifs (16). To probe the mo- 
lecular basis underlying the DNase activities of KSHV 
SOX, we have determined the crystal structure of a 
KSHV SOX-DNA complex involving a linear duplex. 
This first structure of an alkaline exonuclease complex 
provides important insights into the catalytic mechanism 
utilized by these enzymes and suggests that the bridge 
functions in targeting substrates for endo or exo- 
nucleolytic cleavage. We further demonstrate that similar 
to BGLF5, KSHV SOX also has an intrinsic RNase 
activity, however, site-directed mutagenesis studies reveal 
that it is unlikely to be uniquely responsible for host 
shutoff. 



MATERIALS AND METHODS 

Cloning and expression 

The gene-encoding KSHV SOX (GenBank protein ID: 
AA62609) was re-engineered to optimize codon usage 
for expression in Escherichia coli. The codon-optimized 
SOX gene was then PCR amplified using the forward 



and reverse primers CCATGGAAGCCACCCCAACG 
and TCTAGA CTCGAG TTACGGGGAATGCGGCAC 
, incorporating Ncol and Xhol restriction sites, respect- 
ively (underlined). The resulting PCR product was then 
sub cloned into a pCR®-Blunt II- TOPO® vector 
(Invitrogen) and following excision with Ncol and Xhol, 
the insert cloned into pETM6Tl (28) that had been pre- 
viously digested using the same enzymes. This produced a 
6His-NusA-6His-(TEV site)-fusion protein that after 
cleavage with TEV protease had the sequence GA 
N- terminal to the first amino acid. pETM6Tl-SOX was 
co-transformed into BL21(DE3) Star M (Invitrogen) with 
a plasmid encoding the rare tRNAs extracted from 
Rosetta™ 2(DE3) cells (Novagen). The cells were 
inoculated into 100 ml of 2YT media [1.6% (w/v) bacto- 
tryptone (Invitrogen), 1 % bacto-yeast extract (Invitrogen) 
and 0.5% NaCl, adjusted to pH 7.2] containing chloram- 
phenicol (34 (ig/ml) and kanamycin (25 |ig/ml) and grown 
at 37°C overnight. A 1/100 dilution of overnight culture 
was then inoculated into 2YT medium and grown to an 
A 6Q q of 1.0 at 37°C before induction with 1 mM isopropyl 
l-thio-(3-D-galactopyranoside and incubated overnight at 
18°C. The cells were then harvested by centrifuga- 
tion washed with buffer A (200 mM NaCl, 25 mM Tris 
pH 8.5) and stored at -80°C. 

Protein purification and mutagenesis 

Cell extracts were prepared by re-suspending the pellets in 
buffer A supplemented with DNase 1(10 |ig/ml final con- 
centration) and an EDTA-free protease inhibitor cocktail 
tablet (Roche). After sonication on ice, the lysates were 
clarified by centrifugation (46 000g for 1 h at 4°C) and 
the supernatant filtered through a 0.45 urn filter prior 
to loading onto a 5 ml HisTrap™ FF column (GE- 
Healthcare). The column was washed with 20 column 
volumes (CVs) of buffer A containing 50 mM imidazole 
and the protein eluted using 5 CVs of buffer B (buffer A 
supplemented with 500 mM imidazole). The eluate was 
directly loaded onto a 5 ml HiTrap M Q FF column 
(GE-Healthcare) previously equilibrated in 25 mM Tris, 
50 mM NaCl, pH 8.5. The fusion protein was then 
eluted using a 50-1000 mM NaCl linear gradient and the 
tag removed by the addition of TEV protease followed by 
overnight dialysis in a buffer comprising 25 mM Tris 
pH 8.5, 200 mM NaCl, 1 mM DTT. The solution was 
then applied to a 5 ml HiTrap™ Q FF column (pre- 
equilibrated in the same buffer) and the un-tagged 
protein eluted using a 50-500 mM NaCl linear gradient. 
Fractions containing the purest protein were pooled and 
concentrated using a lOkDa cut-off Vivaspin centrifugal 
concentrator (Vivascience). The protein was then loaded 
onto a gel filtration column (Superdex75 HR 10/60) 
that had been pre-equilibrated in a buffer consisting of 
25 mM Tris pH8.5, 300 mM NaCl, 10% glycerol. 
Fractions were analysed on 15% SDS-PAGE gels and 
those containing protein of the highest purity 
concentrated to 4mg/ml using a lOkDa cut-off spin cart- 
ridge (VivaScience), aliquoted and stored at — 80°C. All 
mutants were generated using the QuikChange™ muta- 
genesis kit (Agilent Technologies) and were expressed 
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using the same protocol described for the wild-type 
protein. 

Oligonucleotides 

The oligonucleotides used in the assays and for crystalliza- 
tion are shown in Table 1 . 6-FAM and FL refer to those 3' 
substituted with the fluorophores 6-FAM and fluorescein. 
P-5' refers to oligonucleotides labelled with a 5' phosphate 
group. All duplexes were initially dissolved in water and 
heat annealed. 

Cleavage assays 

DNase and RNase assays were executed according to 
Buisson et al. (16) with some modifications. Exonuclease 
assays were performed using plasmid and linear duplex 
substrates. For those involving plasmids, 20pmol of 
pETM6Tl vector linearized using EcoRI, were incubated 
with 150pmol of KSHV SOX (or SOX mutants) for 1 h in 
a buffer comprising 50 mM Tris-HCl pH 9.0, 200 mM 
NaCl, 20 mM [3-mercaptoethanol ((3ME), lOmM MgCl 2 . 
For the linear substrates, 20pmol of dsDNA or dsDNA- 
5'P (Table 1) were incubated with 150pmol of KSHV 
SOX in the buffer used for the plasmid studies. The 
DNA substrates were substituted for 20pmol of ssRNA- 
5'P in the RNase assay (Table 1), where reactions were 
performed using the incubation buffer described for 
BGLF5 (16). The DNA and RNA reactions were halted 
by the addition of 50 mM EDTA and 7.5 ul of each reac- 
tion mixture were combined with 7.5 ul of No vex TBE 
urea sample buffer (Invitrogen) prior to loading onto 
15% TBE-urea gels (Invitrogen). 

Fluorescence anisotropy 

Fluorescence anisotropy assays were conducted at 25°C 
using a Fluoromax-3 spectrofluorimeter (Jobin Yvon 
Horiba), and the data fitted using GraFit (Erithacus 
Software). Serial dilutions of SOX were incubated for 
30min with 50 nM of dsDNA, dsDNA-5'P, dsRNA-5'P 
or ssRNA-5'P (Table 1) in a binding buffer comprising 
25 mM Tris-HCl pH 8.5, 200 mM NaCl and 10% 
glycerol in a final volume of 60 ul. These were then trans- 
ferred to 60 ul quartz cuvettes for anisotropy measure- 
ments that were repeated six times at each concentration 
of SOX. Experiments were conducted with a slit width 
of 5 nm with the excitation (Aex) and emission (Aem) wave- 
lengths used for the various substrates as follows: dsDNA/ 
dsDNA-5'P (Aex: 492 nm, Aem: 515 nm), ssRNA-5'P 



Table 1. Oligonucleotides used in assays and crystallization trials 



dsDNA 


5' 


-GGGGATCCTCCCAGTCGACC - 3 ' 






FAM- 3 ' - CCCCTAGGAGGATCAGCTGG- 


5' 


dsDNA-5'P 


P- 


- 5' - GGGGATCCTCCCAGTCGACC - 3 ' - 


-FL 




3' 


- CCCCTAGGAGGATCAGCTGG - 5 ' 




dsRNA-5'P 


P- 


■ 5 ' - UGGUUUAC AUGUUC C AAUAUU - 3 


'-FL 




3' 


- AC C AAAUGUAC AAGGUUAUAA - 5 ' 




ssRNA-5'P 


P- 


■ 5 ' - UGGUUUAC AUGUUC C AAUAUU - 3 


'-FL 


Crystallization 








CAdsDNA 


5 ; 


-GGGGATCCTCCCAGTCGACC - 3 ' 






3' 


-CCCCTAGGAGGATCAGCTGG- 5 7 





(Aex: 493 nm, Aem: 520 nm), dsRNA-5'P (Aex: 494 nm, 
Aem: 521 nm). The changes in anisotropy following titra- 
tion of KSHV-SOX into the various DNA or RNA oligo- 
nucleotides (whose anisotropics varied between 0.04 and 
0.1) were used to calculate the affinity constants. All data 
were fitted to a one site binding equation. 

Crystallography 

KSHV SOX at a concentration of 4 mg/ml was incubated 
at 4°C for 3 h with CAdsDNA (Table 1) in a protein:DNA 
ratio of 1:1.2 in an incubation buffer comprising 50 mM 
Tris-HCl pH9.0, 50 mM NaCl and 1.6 mM MgCl 2 . The 
complex was then concentrated to 8 mg/ml (KSHV SOX) 
in a 0.5 ml Millipore 3kDa cut-off centrifugal concentra- 
tor and used to set up hanging drop crystallization trials in 
which 2 ul drops containing equal volumes of protein- 
DNA complex and mother liquor solution were incubated 
at 16°C. Single diffraction quality crystals of SOX grew 
overnight in 180mM ammonium formate, 19% PEG 
3350. These were then cryoprotected in mother liquor 
solution containing 22% ethylene glycol and flash frozen 
in liquid nitrogen. 

Data were collected on beamline ID-23-1 at the ESRF 
on a single flash frozen crystal to 2.5 A that were subse- 
quently processed and scaled using MOSFLM and 
SCALA from the CCP4 suite (29). The structure was 
solved by Molecular Replacement using MOLREP (30), 
manually re-built in COOT (31) and refined using 
PHENIX (32) where cycles of TLS refinement were also 
performed. The TLS groups were defined according to the 
suggestions of the TLS motion determination server 
(33,34). The final model, comprised of a single protein 
monomer, 38 nt, 2 Mg 2+ ions and 127 water molecules 
had an R cryst of 20.89% and 7? free of 26.52%. All stereo- 
chemical parameters were well within the expected ranges 
for a structure at this resolution (Table 2). The co- 
ordinates and structure factors have been deposited in 
the protein data bank under the accession code 3POV. 

RESULTS 

Screening of duplexes for crystallization 

Initial studies using linear duplexes containing a centrally 
located CA mismatch revealed that the presence of the 
mismatch had no effect on either the binding affinity or 
cleavage activity when compared with those substituted 
with a G:C base pair in the analogous position (data not 
shown). In contrast, those labelled with a 5' phosphate 
were highly susceptible to degradation while those that 
were lacking the modification showed little evidence of 
cleavage (Figure 1A). The 5' phosphate substitution, 
however, had only a modest impact on overall binding 
affinity based on the results of fluorescence polarization 
anisotropy assays that indicated similar modes of binding 
for the unmodified or modified duplexes dsDNA and 
dsDNA-5'P, respectively (Table 1 and Figure IB). The 
duplex CAdsDNA containing the CA mismatch but 
lacking the 5' phosphate was therefore used for crystalliza- 
tion trials due to its apparent lack of susceptibility to 
cleavage. 
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Table 2. Data collection and refinement statistics 



(A)] 



Space group 
Unit cell [a, b, c 
Resolution (A) 
Total no. of reflections 
No. of unique reflections 
Redundancy 
Completeness (%) 
<I>/<g(I)> 

n a 

Emerge 

Refinement 

No. of protein atoms 
No. of DNA atoms 
No. of Mg 2+ ions 
No. of solvent atoms 

^work A^cryst (%) 

Maximum likelihood co-ordinate error (A) 

Overall B-factors (A 2 ) d 
Deviations from ideal stereochemistry 

RMSD bonds (A) 

RMSD angles (°) 

RMSD B-factors (A 2 ) 6 

Wilson B-factor (A 2 ) 
Ramachandran plot analysis f 

Most favoured (%) 

Additionally allowed (%) 

Disallowed (%) 



P2i2!2! 

54.52, 67.66, 174.56 

63.1-2.5 (2.6-2.5) 

89560 

22693 

3.9 (4.0) 

98.9 (99.4) 

8.7 (2.6) 

9.3 (53.3) 

3467 

775 

2 

127 

20.89/26.52 
0.38 

64/55/69.0/108/51.4/79.0 

0.002 
0.653 
8.0 
48.6 

95.85 

4.84 

0.69 



Values in parentheses are for the highest resolution shell (2.6-2.50 A). 
a R merge = S(|Ii - <I>|)/S(<I>), where the sum is calculated over all 
observations of a measured reflection (Ij), and <I> is the mean inten- 
sity of all the measured observations (10 . 

b R cryst = S (|F obs -F calc |)/E (F obs ), F obs are the observed structure factor 

amplitudes, and F calc those calculated from the model. 

c Rf ree is equivalent to R cry st but where 5% of the measured reflections 

have been excluded from refinement and set aside for cross-validation 

purposes. 

d Average B-values for all atoms/protein/Mg 2+ /DNA/solvent/formate 
ion/. 

e RMSD in B-factor between covalently bonded main chain atoms and 
side chain atoms. 

f Ramachandran plot analysis was from molprobity (45). 



Overall structure 

The structure of the KSHV SOX-DNA complex was 
determined by molecular replacement using the protein 
co-ordinates of the native KSHV-SOX enzyme [PDB 
entry 3FHD (see 'Materials and Methods' section)]. 
Initial inspection revealed density for the DNA in the 
vicinity of the proposed DNA binding region or 
'canyon' that is located in the interface between the N- 
and C- terminal lobes of the molecule (Figure 2A). 
Despite containing a centrally located, wobble base 
paired CA mismatch (A29:C12), there is no discernable 
distortion of the duplex that adopts a standard B con- 
formation (Supplementary Figure SI A). Out of the 
40 nt, all could be assigned although Gl and C40 were 
only partially ordered. Although nucleotides 2-13 in 
strand 1 and 31-39 in strand 2 are stabilized by a 
variety of protein-DNA interactions (described in the 
6 Protein-DNA interactions' section), those remaining do 
not interact with the canyon of KSHV-SOX and form few 
interactions with symmetry related protein molecules in 
the crystal lattice (Supplementary Figure SIB). As a con- 
sequence, these nucleotides are considerably less well 



WT nuclease 
lOmM Mg 2+ 



+ 
+ 



+ 
+ 



B 




dsDNA/dsDNA 5'-P 



■ dsDNA 
® dsDNA-5'P 



20 40 60 

SOX Concentration (jxM) 



Substrate 


K d (JIM) 


dsDNA 


12 ±0.7 


dsDNA-5'P 


6 ±0.6 



Figure 1. (A) Exonuclease assays involving the DNA substrates 
dsDNA and dsDNA-5'P illustrate that cleavage is only observed on 
duplexes in which one strand has been substituted with a 5 ; phosphate 
group. (B) Fluorescence anisotropy assays indicate that both duplexes 
bind to SOX with similar affinities (12 uM for dsDNA and 6uM for 
dsDNA-5'P) suggestive of largely equivalent modes of binding. 



ordered consistent with the overall higher B-values and 
breaks in the phosphodiester backbone observed. Aside 
from these regions, the electron density is of good 
quality overall (see Supplementary Figure S2A). 

Protein-DNA interactions 

Seven conserved motifs are essential for DNase activity in 
the alkaline exonucleases (Figure 2B) (14). Of these, 
residues within motifs III, IV and VI, which together 
form the canyon in the SOX molecule, interact directly 
with the duplex (Figure 2C) along with those in a loop 
between motifs IV and V (residues 315-320). This region 
maps to the second SOX nuclear localization sequence 
(NLS2) identified in BGLF5 and is highly conserved 
among the y-herpesvirus homologues (25,35). When 
compared with the native structure, the DNA binding 
motifs undergo modest remodelling on association with 
DNA (the RMSD calculated for all equivalent Coc atoms 
is 1.5 A); the only exception being residues 315-318 of 
NLS2 that are absent in the native structure, but fully 
ordered in the complex (Supplementary Figure S2B). 

Consistent with the lack of sequence preference ex- 
hibited by the SOX homologues, the majority of 
protein-DNA contacts involve the phosphodiester 
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backbone and mostly occur in the minor groove. There are 
only two areas of specific contact that cluster at the 5'-end 
of strand 1 (close to the active site) and a more central 
region of the duplex involving strand 2 predominantly 
(Figure 3), although a plethora of van der Waals 



interactions involving the three DNA binding motifs and 
NLS2 are observed. Motif III features heavily in the rec- 
ognition of the G3, G4 and A5 phosphate groups towards 
the 5'-end of strand 1. In this region, the group of 
K246, located within the strictly conserved PD-(D/E)XK 




B 




Motif I 

KSHV 122 VERATSSQSLNPVWDALRDGI ISSSKF 148 

BHV4 122 LESITREQHLSTLWDTLRNGI ISSSKM 148 

MHV68 120 IEALTRGQQDNELWDILRDGMISSSKF 146 

BGLF5 106 VESMTRGQSENLMWDILRNGIISSSKL 132 

PLHV2 116 I ENMT RGQ S EN I LWE I LRDGV I S S S KL 142 

UL12 217 YEAATQNQADCQLWALLRRGLTTASTL 244 

Motif III 



KSHV 243 YEIKCRFKYTFAKME 257 
BHV4 243 YEIKCRFKYMFSKSE 257 
MHV68 239 YEIKSRYKYQFSKSE 253 
BGLF5 224 YEIKCRFKYLFSKSE 238 
PLHV2 238 YEIKCRYKYLFSKSE 252 
UL12 362 YEVKCRAKYAFDPMD 377 



Motif V 

KSHV 335 NSTTESDVYVL 345 
BHV4 335 NSCVESDVFIL 345 
MHV68 333 NMYQQSTVYIL 343 
BGLF5 316 NRGVESMLYVM 326 
PLHV2 331 NCYTESKVIIL 341 
UL12 455 NSGWSEVLLF 465 

Motif VII 



KSHV 
BHV4 
MHV68 
BGLF5 
PLHV2 
UL12 



433 IPTLLIVTPVYF 444 
431 IPVMIIITQVRI 446 
433 IPVALIITPVRI 444 
417 IPVAVLVTPWL 428 
430 IPVLLIITPVFI 441 
565 VPIALIITPVRI 576 



Motif II 

206 GFMQ.SPQNGI FGVSLD 221 

206 GFMQSPLDGI FGVSLD 221 

204 GFLPSAIDGI FGVSLD 219 

188 GFMISPTDGI FGVSLD 203 

200 GFMLSP I DGI FGVSLD 215 

325 GVLMDGHTGMVGASLD 340 

Motif IV 



281 FFYSISKPAVEYVGLGKLPSESDYLVAYDQEW 311 
281 FILSVSRPAVEFVAPGGIPSERDFLLTHGLEW 311 
279 FINCISRPAVEYVPHGKLPSESDYLIIHSMNW 309 
261 FINSIARPTVEYVPDGRLPSEGDYLLTQDEAW 291 
276 F I SS I PKPAVEHVPRGRVPTQNDYLLS FDK I W 306 
415 F I RS I PKP S VRYFAPGRVPGPEE ALVTQDQAW 419 



Motif VI 

365 VFVNVRHSYFYQVLLQSSIVEEYI 388 

365 LFVNPKHTYFFQVLLQYKWESYI 388 

363 VFVNPAHNYFYQVALQNMWQDYI 386 

346 I F INPRHNYF YQVLLQYKIVGDYV 369 

342 LY INPRHAY Y YQILLQYKIVTNY I 365 

491 VFANPRHPNFKQILVQGYVLDSHF 514 

Bridge 

154 QQNTSKK I FSPWP I TNNHFVAGPLAFG 180 
154 QRKTSKK I FEPWP I KNN YYVAS PLAFG 180 
152 QHS I NKKLFNPQP I KVNHYFAGPLAFG 17 8 
138 N — GPTKVFEPAPISTNHYFGGPVAFG 162 
148 QQAADNK I FDPLP I QKNHYVAS P I AFG 17 4 
250 GPCFSPQWLKHNASLRPDVQSSAVMFG 276 




Figure 2. (A) Cartoon of KSHV SOX and corresponding 2mFo-DFc omit map density for the DNA (magenta, contoured at la) bound at the 
interface or 'canyon' between the N- and C-terminal lobes (green and yellow, respectively) of the SOX molecule. The density for nucleotides at the 
free end of the duplex is more fragmented due to a lack of stabilizing protein-DNA interactions. (B) Sequence alignments of the conserved DNase 
motifs and bridge regions in the y-herpesvirus SOX homologues from Human herpesvirus 8 (KSHV), Bovine herpesvirus 2 (BHV2), Mouse 
herpesvirus 68 (MHV68), Epstein-Barr virus (BGLF5), Porcine lymphotropic herpesvirus 2 (PLHV) and UL12 from the a-herpes virus HSV-1. 
The alignment was performed using CLUSTALW (http://www.ebi.ac.uk/Tools/clustalw2/index.html). (C) The locations of motifs II (pink), III (grey), 
IV (orange) and VI (blue) within the KSHV-SOX complex that out of the seven conserved sequences mediate the observed protein-DNA inter- 
actions or are involved in catalysis. The position of the partially ordered bridge (red) is also highlighted. The second SOX nuclear localization 
sequence (NLS2, green), that has a role in DNA binding is also shown. 
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Figure 3. Schematic diagram of protein-DNA hydrogen bonds 
stabilizing the complex. Those depicted by dashed lines and asterisks 
are mediated by water molecules and those with unbroken lines, main 
or side chain groups. CI 2 and A29 (marked with red asterisks) corres- 
pond to the CA mismatch. 



sequence, interacts with OIP of G3 via a water molecule 
while the main chain NH groups of K250 and F249 (both 
highly conserved) donate hydrogen bonds to OIP of G4 
(Figures 3 and 4A). The carbonyl oxygen of K250 also 
contributes a water mediated hydrogen bond to OIP of 
A5. Y373, located in motif VI, donates an additional 
hydrogen bond to OIP of G3 via its hydroxyl group. In 
contrast, recognition of the central duplex region is largely 
dependent on residues K318, R319 and K320 in NLS2, 
L297 and E300 in domain IV and R370 in motif VI 
(Figures 3 and 4B). Hydrogen bonds are donated to 
OIP and 02P of G33 by the side chain NH1 and main 
chain NH groups of R370 and E300, respectively. The 
moiety of K318 donates a hydrogen bond to 02P of A32 
that is further stabilized by a water mediated hydrogen 
bond involving the carbonyl oxygen of L297. A32 is add- 
itionally recognized by a hydrogen bond donated by the 
peptide NH group of R3 19 to OIP. The side chain moiety 
of R319 is observed to protrude into the minor groove 
where it is able to mediate a range of van der Waals inter- 
actions with C12, C31 and A32 (Figure 4B). Within this 
region, K320 additionally contributes direct and water 
mediated hydrogen bonds via its main chain NH and 
carbonyl oxygen moieties, respectively, to OIP of A13. 
No protein-DNA interactions are observed for nucleo- 
tides G14-G20 of strand 1 and G21-G30 of strand 2. 

Active site configuration and cleavage geometry 

Essential to the endo/exonuclease activities of the viral 
SOX homologues are residues within the conserved PD- 
(D/E)XK sequence where the D and consecutive D/E 
residues are located within motifs II and III, respectively, 
and correspond to D221 and E244 in KSHV SOX (25). In 
the native structure, these together with El 84, which is 
also strictly conserved in the SOX homologues, contribute 
ligands to a bound Mg 2+ ion (Mgl) via their carboxylate 
groups (15). Mgl, though slightly shifted and less well 
ordered, is also present in our complex and, as a result 
of this re-positioning, falls within hydrogen bonding 
distance of the N£ group of K246 (Figure 4C). Since the 
predominant activity of SOX is that of a 5-3' exonuclease, 
it has been suggested that the scissile bond, located at the 
3'-end of the 5' nt, should be within hydrogen bonding 
distance of Mgl. This is inconsistent with the position of 
G2 in our complex, the site of exonucleolytic cleavage, 
whose partially ordered phosphate group is angled away 
from the catalytic region and >7A from Mgl. 

While cleavage at G2 based on our structure would be 
highly unlikely, G3 is more favourably juxtaposed with 
respect to the catalytic region. In our complex, the phos- 
phate group of G3 forms water mediated hydrogen bonds 
with K246 of the PD-(D/E) XK motif as previ- 
ously described and also contacts a second Mg 2+ ion 
(Mg2, present at a level of 5a in mFo-DFc omit maps, 
Figure 4A) absent in the native KSHV structure. Mg2 is 
located a distance of ~5 A from Mgl and is further liganded 
to Oel of El 84, a water molecule and the peptide NH group 
of C247. The presence of two metal ions in the catalytic 
region is suggestive of a classical two metal S N 2 mechanism 
[reviewed in (36,37)]; however, the phosphate group of G3 is 
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Figure 4. DNA recognition. (A) Stabilization of the duplex in the region closest to the catalytic centre exclusively involves strand 1 and is dominated 
by residues in motif III. The protein-DNA contacts are a combination of water, main and side chain mediated hydrogen bonds involving residues 
K246, F249, K250 and Y373 (colour coded according to their motif locations as in Figure 2C) and the phosphate groups of G3, G4 and A5 together 
with a range of van der Waals interactions. mFo-DFc omit map density (red, contoured at 3a although present at 5a) attributable to a second metal 
ion (green) was also identified that is absent in the native KSHV structure. (B) The protein-DNA interactions observed for the second region of 
contact involves a more mixed distribution of residues over the conserved motifs compared with the first. Here residues R370 (motif VI), K318 and 
K320 (NLS), E297 and E300 (motif IV) mediate water, main and side chain hydrogen bonds with G31, A32 and G33 of strand 2 and A13 of strand 
1. The side chain moiety of R319 protrudes into the minor groove where it makes a range of van der Waals contacts with C12, G31 and A32. (C) 
Mgl and surrounding co-ordination sphere in the wild-type (left) and DNA bound (right) KSHV SOX structures. The loss of tight ligands 
contributed by El 84 and a water molecule (disordered in our complex) results in a weaker association as a result of a shift of Mgl. In this 
position, Mgl is within hydrogen bonding distance of the N^ group of K246. (D) Schematic diagram of the classical two metal nucleophilic 
attack mechanism. The oxygen atom of the activated water is shown in red. 
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not in the canonical position for endonucleolytic processing. 
In most examples of two metal mechanisms, the metal ions 
are co-ordinated between the non-bridging oxygens of the 
scissile phosphate (Figure 4D) and are further liganded by a 
highly conserved aspartate. In this configuration, one metal 
ion acts to deprotonate a water molecule for generation of a 
hydroxyanion attacking nucleophile, while the other stabil- 
izes the resultant pentavalent transition state intermediate. 
Even though Mgl and Mg2 are both liganded to El 84 that 
could substitute for an aspartate, only Mg2 is in direct 
contact with the o phosphate group of G3 (Figure 5A). 
Mgl, located ~7A from G3, would therefore be unable to 
generate a hydroxyanion for nucleophilic attack of the G3 
P-03 bond. 

Although Mgl is proposed to act as a hydroxyanion 
generator, structural and mutagenesis studies have sug- 
gested that highly conserved amino acid residues such as 
lysine, tyrosine or serine might fulfil this role [reviewed in 
(37)]. Our complex suggests that in principle, K246 could 
activate the water molecule bound to OIP of G3 and has been 
shown to have a key catalytic role in support of this (38). 



However, the so generated nucleophile would be poorly 
positioned for inline attack. Alternatively, the hydroxyl 
group of Y373 that is also hydrogen bonded to OIP of 
G3 could be deprotonated to form an oxyanion that could 
in turn generate a phosphotyrosyl covalent intermediate. 
This can be rejected on the basis of site directed mutagen- 
esis studies that show Y373 is a non-essential residue (data 
not shown). Although there is still the possibility that 
KSHV-SOX might utilize a novel variant of a two metal 
mechanism in which G3 could be well placed for catalysis, 
the fact that the duplex used in our crystallization studies 
remains intact and that it fails to undergo processing in 
our in vitro assay strongly suggests that G3 is intractable 
to cleavage, similar to G2. This is strengthened by the 
observation that endonucleolytic processing appears to 
be restricted to plasmid substrates (7,39) and has not 
been reported for linear duplexes. The inability of linear 
duplexes lacking a 5' phosphate to be efficiently targeted 
for exonucleolytic processing or undergo endonucleolytic 
cleavage can therefore be explained by the association of 
these oligonucleotides with KSHV-SOX resulting in 





E184 



Figure 5. (A) Stereofigure showing the active site geometry in relation to G3, the nucleotide whose phosphate group is in contact with K246 of the 
PD-(D/E)XK and the second metal ion, Mg2. The relative juxtaposition of G3 to Mgl and a possible catalytic water molecule are inconsistent with a 
configuration that would promote cleavage at this site based on the classical two metal mechanisms. (B) Rotation of Gl out of the duplex stack such 
that a modelled 5' phosphate group would be able to hydrogen bond to SI 46 and S219 (in the rotamer conformations observed in the native KSHV 
structure) that form a conserved 'serine cluster' (brown), is sufficient to align the scissile phosphate G2 with Mgl and Mg2 for cleavage in a near 
classical two metal mechanism configuration. In this conformation, the two metal ions straddle the stereospecific oxygen OIP and the water molecule 
bound to K246 (A, light blue sphere) is well placed for inline attack. 
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potential scissile phosphate groups being inappropriately 
positioned for cleavage as observed in our complex. 

5' phosphate recognition: the key to exo and 
endonucleolytic cleavage 

Although the relative juxtapositions of the scissile phos- 
phate, metal ions and attacking nucleophile are important 
factors for cleavage, substrate stabilization is also essen- 
tial. For the exonucleolytic reaction, this would addition- 
ally involve recognition of the 5' phosphate. A putative 5' 
phosphate binding site, encompassing residues SI 44, SI 45 
and SI 46 of motif I together with S219 of motif II, has 
been proposed for the SOX homologues based on the 
findings that these residues are associated with a 
sulphate ion and anion in the native KSHV and BGLF5 
structures, respectively (15,16) (we also note a formate ion 
bound in this region of our complex). The importance of 
this 'serine cluster' is further underpinned by mutagenesis 
studies where substitution of S219 in both the HSV-1 and 
baculovirus SOX homologues results in significant attenu- 
ation of exo and endonuclease activities (7,39). In order 
for a modelled 5' phosphate group on Gl in our complex 
to contact the cluster for exonucleolytic cleavage, the nu- 
cleotide would have to be rotated towards motif 1 from its 
position stacked against G2 (implied by the partially 
ordered density) to enable the formation of hydrogen 
bonds with SI 46 and S219. These in turn, would only be 
required to adopt the side chain rotamers observed in the 
native KSHV structure (Figure 5B). Surprisingly, 
manipulating Gl into this position results in optimal 
placement of G2 as the scissile phosphate. In this config- 
uration, Mgl and Mg2 have the classical geometry in 
which both are liganded to the stereospecific, non-bridging 
oxygen OIP. In addition, the water molecule hydrogen 
bonded to both K246 and OIP (though out of the range 
for contacts with either Mgl or Mg2), is now near opti- 
mally positioned as a nucleophile given an inline attack 
angle of 147°. These results suggest that for exonucleolytic 
processing, stabilization of the 5' phosphate group is key 
to the correct positioning of the scissile bond for cleavage. 

While mutagenesis studies focused on S219 have shown 
that the serine cluster has a more prominent role in 
exonucleolytic cleavage, as evidenced by the higher level 
of attenuation compared with that observed for 
endonucleolytic processing (7), it is nonetheless key to 
both reactions. Based on our model for exonucleolytic 
cleavage, the association of a phosphate group with this 
site in the context of a continuous duplex could only occur 
if the helix were locally denatured. In keeping with this, 
the recent studies of Mikhailov et al. demonstrate that 
endonucleolytic processing is only observed on 
single-stranded minicircle or supercoiled plasmid sub- 
strates that, in an attempt to relieve torsional stress, 
develop local regions of single-stranded DNA that are 
the likely targets for cleavage (7,39). Though difficult to 
speculate on how supercoiled/mini-circle plasmids or 
indeed the natural substrate would be accommodated, 
comparison of our complex with the BGLF5 SOX homo- 
logue strongly implicates the bridge as being essential to 
this process. Bridge architectures, based on studies 



involving the flap endonucleases, were originally 
proposed to have a pivotal role in exonucleolytic 
cleavage by inducing strand separation for targeting of 
the scissile strand. It has further been suggested that 
they function as DNA 'clamps' stabilizing specific types 
of DNA architecture (22,40,41). In both our complex and 
native KSHV SOX, the bridge (that spans amino acid 
residues 154-180, Figure 2B) is only partially ordered. 
The analogous region, however, is completely intact in 
the BGLF5 SOX mutant D203S that shares -42% 
sequence identity with the KSHV enzyme. 

Superposition of the BGLF5 D203S co-ordinates 
with those of our complex reveals that while both struc- 
tures have a high degree of similarity overall (see 
Supplementary Figure S2C), the duplex could not be 
accommodated in the canyon of the BGLF5 mutant 
without substantial re-modelling of the 5'-end of strand 
1. This would be due to severe steric clashes between 
bridge residues G156 to VI 59 in BGLF5 (equivalent to 
A 174 and LI 77 in KSHV SOX, absent in both our 
complex and the native enzyme) and all base pairs 5' to 
G4 (Figure 6A). These unfavourable contacts arise from 
an ~5.5A lateral movement of residues A 160 and F161 
(equivalent to A 178 and F 179 in KSHV SOX) towards the 
C-terminal lobe of the SOX molecule, away from the 
active site. The nucleotides 5' to G4 could only be favour- 
ably accommodated in the context of the BGLF5 bridge 
conformer if they were no longer stacked in a regular 
duplex but instead situated in largely single-stranded 
regions of DNA. The conformation of the bridge 
observed in the D203S BGLF5 mutant, therefore, seems 
more consistent with endonucleolytic cleavage in which 
major local distortion of the DNA appears to be 
required for recognition in contrast to the exonuclease 
reaction where only relatively minor changes focused on 
the 5'-nt are required. This suggests that conformational 
re-modelling of the bridge may be the trigger for switching 
SOX between endo and exonucleolytic cleavage modes. 

RNase activity and host shutoff 

Although it has been proposed that host shutoff results 
from subversion of host mRNA degradation pathways, 
recent data have shown that in EBV, BGLF5 has an in- 
trinsic, Mn 2+ dependent RNase activity. It has been 
demonstrated that this activity involves the catalytic 
residues utilized for DNA processing (16). To establish 
whether KSHV SOX also shares this intrinsic activity, 
we performed a similar RNase assay using a uracil rich 
ssRNA 20-mer (Figure 6B). Our results show unequivo- 
cally that KSHV SOX possesses an RNase activity that 
cannot be attributed to contamination since experiments 
performed with the D221S and E244S mutants lacking 
DNase activity but competent in DNA binding (data 
not shown) reveal its complete abolition. These results 
are similar to those reported for BGLF5 confirming that 
both DNA and RNA processing utilize the same catalytic 
machinery and is consistent with the in vivo studies of 
Glaunsinger et al. (25). Unlike BGLF5, however, the 
RNase activity of KSHV SOX does not appear to be 
Mn 2+ dependent since similar activities are observed in 
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affinities (K d 's of 87 and 95 uM, respectively) that is sug- 
gestive of engagement with only one strand. Interestingly, 
the affinity for RNA is considerably weaker than for 
DNA (-15- to 16-fold weaker when ssRNA-5'P and 
dsRNA-5'P are compared with dsDNA-5'P, Figure 6C). 

In addition to those involved in catalysis, residues 
within the N-terminal lobe of SOX (Figure 7A) have 
also been shown to have an essential role in host shutoff 
(25). To ascertain whether these non-catalytic shutoff 
residues contribute to the RNase activity of KSHV 
SOX, the mutants T24I, A61T, P176S, V369I, D474N 
and Y477STOP, shown to significantly attenuate or 
abolish global degradation of host mRNA, were generated 
(with the exception of T24I that was insoluble) and 
screened using our RNase and DNase assays (Figure 7B 
and C). All mutants exhibited wild- type RNase and 
DNase activities. Our results, therefore, suggest that 
host shutoff in KSHV cannot be entirely explained by 
the intrinsic RNase activity of KSHV SOX which would 
in turn support the view that SOX mediates host shutoff 
by interacting with as yet unidentified cellular factors. 




SOX Concentration (|xM) 



Substrate 


Kd (|IM) 


ssRNA-5'P 


87 ± 3.3 


dsRNA-5'P 


95 ± 3.7 


dsDNA-5'P 


6 ± 0.6 



Figure 6. (A) Superposition of the KSHV DNA complex with the D203S 
BGLF5 mutant reveals a number of severe steric clashes involving residues 
G156-V159 (BGLF5) at the C-terminus of the bridge and all base pairs 5' 
to G4. This is as result of repositioning of residues A160 and F161 that 
move ~5.5 A towards the C-terminal lobe of the SOX molecule. Only a 
denatured duplex could interact with the serine cluster (highlighted in 
green) and be favourably accommodated by the bridge in this alternative 
conformation. (B) RNase assay of wild-type KSHV SOX in the presence 
of 10 mM MgCl 2 or MnCl 2 . (C) Fluorescence anisotropy binding assays of 
SOX involving single- (ssRNA-5'P) and double-stranded (dsRNA-5'P) 
RNA oligonucleotides. The results obtained for dsDNA-5'P are included 
for comparison. 

assays performed in the presence of Mg 2+ alone at the 
same concentration. Fluorescence Anisotropy studies 
(Figure 6C) further reveal that SOX is able to associate 
with single- and double-stranded RNA with similar 



DISCUSSION 

We report the first crystal structure of a viral alkaline 
exonuclease in complex with a DNA duplex. Our 
complex has confirmed the involvement of the D-(D/ 
E)XK motif and residues in conserved motifs III, IV, VI 
(previously implicated in DNase activity) and NLS2 in 
DNA binding. Although the duplex used in our studies 
did not undergo processing, the structure has provided key 
insights into the cleavage geometry utilized by the SOX 
nucleases. To this end, we have identified two Mg 2+ ions 
bound within the catalytic region and shown that align- 
ment of the putative 5' phosphate binding site (serine 
cluster) with the 5' phosphate of the substrate results in 
a near classical two metal ion cleavage configuration 
(36,37,42). A potential nucleophilic water molecule has 
also been identified based on this model that has near 
perfect geometry for inline attack. Interestingly, the asso- 
ciation between this water molecule and K246 together 
with its overly long distance from either of the two 
metal ions is highly suggestive of K246 acting as the 
general base. Although this residue is strictly conserved 
among the SOX homologues and the equivalent lysine in 
the endonuclease MutH binds to a nucleophilic water 
molecule in a similar configuration (43), a major 
drawback in this assignment lies in its positive charge at 
pHs < 10.5. This would prevent abstraction of a proton 
and subsequent generation of an attacking nucleophile at 
the optimal pH of 8.5-9.0. The juxtaposition of K246 to 
Mgl in our structure, however, would substantially lower 
its pKa rendering it capable of generating a hydroxy anion. 
Alternatively, it is possible that the two metal ions 
approach each other more closely than the 5 A observed 
in our complex since 4 A has been cited as the optimum 
separation distance for maximal transition state stabiliza- 
tion (37). Following such movement, Mgl might be well 
placed to assist in nucleophile generation. 
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Figure 7. (A) Cartoon showing the locations of the non-catalytic host shutoff residues (red) within the complex (the DNA is shown in light blue). 
(B) RNase assays of all mutants (excluding T24I) known to reduce/abolish host shutoff. (C) Plasmid exonuclease assays of catalytic and non-catalytic 
shutoff mutants. 



In addition to addressing the cleavage geometry, our 
structure has also shown that endonucleolytic processing 
could only occur on highly distorted duplexes containing 
regions of single-stranded DNA in keeping with the 
reported biochemical studies. They also suggest that 
these structures are likely to be stabilized by the bridge 
and that conformational re-modelling of this region may 
be the trigger for their recognition. Although bridge or 
arch architectures were originally thought to function as 
strand separators forming an aperture through which a 
single DNA strand could be threaded for high 
processivity, there is no evidence of this in our complex 
or that it would be required for catalysis. It appears that 
exonucleolytically processed substrates are ostensibly 
bound as duplexes [as suggested by Dahlroth et al. (15)], 
and that the only re-arrangements required are those ne- 
cessary for recognition of the 5' phosphate group. This is 



in good agreement with the similar K d 's observed for the 
duplex used in our structural studies and the same duplex 
substituted with a 5' phosphate group. It is also difficult to 
envisage how a threading mechanism would address the 
endonuclease reaction satisfactorily with stabilization of 
highly distorted DNA structures by the bridge appearing 
a more likely scenario. This is in keeping with the bridge 
acting as a DNA 'clamp' that has also been proposed (22). 
In order to fully characterize the function of the bridge 
and further analyse the significance of the conformational 
states observed, additional biochemical and structural 
studies will have to be performed using more physiological 
branched substrates although there is still uncertainty 
regarding their exact nature. 

While the DNase activity of the y-herpesvirus SOX 
homologues is important in the maturation of virus par- 
ticles, it has also been shown to compromise the integrity 
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of the host genome in EBV (44). This combined with the 
rapid and global degradation of host mRNA will ultim- 
ately culminate in cell death. Although the exact mechan- 
ism of host shutoff remains to be determined, it was 
initially thought not to involve an intrinsic RNase 
activity. Recently, it was reported that the EBV SOX 
homologue BGLF5 possesses an in vitro RNase activity 
(16). In the current study, we were able to characterize a 
similar but non-identical metal ion-dependent RNase 
activity that interestingly was not attenuated by the 
non-catalytic mutations associated with shutoff. While 
RNase activity is clearly an important element of 
shutoff, our results indicate that SOX is not solely respon- 
sible for the phenomenon and that additional factors, 
most likely involving protein-protein interactions (as ini- 
tially suggested), are required for degradation of host 
mRNA. This is further supported by the relatively low 
affinity of SOX for RNA obtained from our fluorescence 
polarization studies that is suggestive of a requirement for 
an interacting partner to enhance binding and possibly 
increase processivity. Given the burden of disease caused 
by herpesviruses in general and KSHV specifically, under- 
standing the molecular basis underlying their intrinsic 
DNase and RNase activities, together with the identifica- 
tion and characterization of potential interacting partners, 
will provide alternative strategies for the design of novel 
therapeutics. Our KSHV-SOX-DNA complex and pre- 
liminary characterization of the KSHV SOX RNase 
activity are therefore important steps towards this goal. 
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